Correspondence Learning for Deep Multi-Modal Recognition and Fraud Detection
نویسندگان
چکیده
Deep learning-based methods have achieved good performance in various recognition benchmarks mostly by utilizing single modalities. As different modalities contain complementary information to each other, multi-modal based are proposed implicitly utilize them. In this paper, we propose a simple technique, called correspondence learning (CL), which explicitly learns the relationship among multiple The data samples randomly mixed samples. If from same sample (not mixed), then they positive correspondence, and vice versa. CL is an auxiliary task for model predict expected extract modality check achieve better representations tasks. work, first validate method including CMU Multimodal Opinion-Level Sentiment Intensity (CMU-MOSI) Opinion Emotion (CMU-MOSEI) sentiment analysis datasets. addition, fraud detection using learned To additional usage, collect dataset real-world reverse vending machines.
منابع مشابه
Deep Multi-Modal Image Correspondence Learning
Inference of correspondences between images from different modalities is an extremely important perceptual ability that enables humans to understand and recognize crossmodal concepts. In this paper, we consider an instance of this problem that involves matching photographs of building interiors with their corresponding floorplan. This is a particularly challenging problem because a floorplan, a...
متن کاملMulti-modal Deep Learning Approach for Flood Detection
In this paper we propose a multi-modal deep learning approach to detect floods in social media posts. Social media posts normally contain somemetadata and/or visual information, therefore in order to detect the floods we use this information. The model is based on a Convolutional Neural Network which extracts the visual features and a bidirectional Long Short-Term Memory network to extract the ...
متن کاملModal Matching for Correspondence and Recognition
--Modal matching is a new method for establishing correspondences and computing canonical descriptions. The method is based on the idea of describing objects in terms of generalized symu&ries, as defined by each object’s eigenmodes. The resulting modal description is used for object recognition and categorization, where shape simihuities are expressed as the amounts of modal deformation energy ...
متن کاملMulti-Modal Multi-Task Deep Learning for Autonomous Driving
Several deep learning approaches have been applied to the autonomous driving task, many employing end-toend deep neural networks. Autonomous driving is complex, utilizing multiple behavioral modalities ranging from lane changing to turning and stopping. However, most existing approaches do not factor in the different behavioral modalities of the driving task into the training strategy. This pap...
متن کاملCorrelated and Individual Multi-Modal Deep Learning for RGB-D Object Recognition
In this paper, we propose a correlated and individual multi-modal deep learning (CIMDL) method for RGB-D object recognition. Unlike most conventional RGB-D object recognition methods which extract features from the RGB and depth channels individually, our CIMDL jointly learns feature representations from raw RGB-D data with a pair of deep neural networks, so that the sharable and modalspecific ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2021
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics10070800